Goto

Collaborating Authors

 run machine


Why and how to run machine learning algorithms on edge devices

#artificialintelligence

Intel's Neural Compute Stick 2 is an example of machine learning hardware for edge devices. Analyzing large amounts of data based on complex machine learning algorithms requires significant computational capabilities. Therefore, much processing of data takes place in on-premises data centers or cloud-based infrastructure. However, with the arrival of powerful, low-energy consumption Internet of Things devices, computations can now be executed on edge devices such as robots themselves. This has given rise to the era of deploying advanced machine learning methods such as convolutional neural networks, or CNNs, at the edges of the network for "edge-based" ML.


How to do Gesture identification through machine learning on Arduino

#artificialintelligence

In this Arduno Machine learning project we're going to use an accelerometer sensor to identify the gestures you play. This is a remake of the project found on the Tensorflow blog. We're going to use a lot less powerful chip in this tutorial, tough: an Arduino Nano (old generation), equipped with 32 kb of flash and only 2 kb of RAM. We're going to use the accelerations along the 3 axis (X, Y, Z) coming from an IMU to infer which gesture we're playing. We'll use a fixed number of recordings (NUM_SAMPLES) starting from the first detection of movement.


How to run machine learning at scale -- without going broke

#artificialintelligence

Machine learning is computationally expensive -- and because serving real-time predictions means running your ML models in the cloud, that computational expense translates into real dollars. Put another way, if you wanted to add a translation feature to your app that automatically translated text to your user's preferred language, you would deploy an NLP model as a web API for your app to consume. To host this API, you would need to deploy it through a cloud provider like AWS, put it behind a load balancer, and implement some kind of autoscaling functionality (probably involving Docker and Kubernetes). None of the above is free, and if you're dealing with a large amount of traffic, the total cost can get out of hand. This is especially true if you aren't optimizing your spend.


Neural Magic gets $15M seed to run machine learning models on commodity CPUs โ€“ TechCrunch

#artificialintelligence

Neural Magic, a startup founded by a couple of MIT professors, who figured out a way to run machine learning models on commodity CPUs, announced a $15 million seed investment today. Comcast Ventures led the round, with participation from NEA, Andreessen Horowitz, Pillar VC and Amdocs. The company had previously received a $5 million pre-seed, making the total raised so far $20 million. The company also announced early access to its first product, an inference engine that data scientists can run on computers running CPUs, rather than specialized chips like GPUs or TPUs. That means that it could greatly reduce the cost associated with machine learning projects by allowing data scientists to use commodity hardware.


Tensor Processing Units were purpose-built for machine learning: Pros, cons

#artificialintelligence

Google said its own requirements drove the development of TPUs -- both the company's earlier first-generation TPU as well as the second-generation TPU that was announced in May 2017. "While our first TPU was designed to run machine learning models quickly and efficiently -- to translate a set of sentences or choose the next move in [the board game] Go -- those models still had to be trained separately. Training a machine learning model is even more difficult than running it, and days or weeks of computation on the best available CPUs and GPUs are commonly required to reach state-of-the-art levels of accuracy," Google stated in a May 17, 2017, blog. Although its research and engineering teams have made "great progress" in scaling the difficult task of training machine learning models using readily-available hardware, the blog post continued, the first-generation TPU "wasn't enough to meet our machine learning needs." Google's new machine learning system was built to eliminate bottlenecks and maximize overall performance, using second-generation TPUs to both train and run machine learning models, the company touted.


Build and train machine learning models on our new Google Cloud TPUs

@machinelearnbot

We're excited to announce that our second-generation Tensor Processing Units (TPUs) are coming to Google Cloud to accelerate a wide range of machine learning workloads, including both training and inference. We call them Cloud TPUs, and they will initially be available via Google Compute Engine. We've witnessed extraordinary advances in machine learning (ML) over the past few years. Neural networks have dramatically improved the quality of Google Translate, played a key role in ranking Google Search results and made it more convenient to find the photos you want with Google Photos. Machine learning allowed DeepMind's AlphaGo program to defeat Lee Sedol, one of the world's top Go players, and also made it possible for software to generate natural-looking sketches.